首页> 外文OA文献 >Fidelity-based probabilistic Q-learning for control of quantum systems
【2h】

Fidelity-based probabilistic Q-learning for control of quantum systems

机译:基于保真度的概率Q学习控制量子系统

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The balance between exploration and exploitation is a key problem for reinforcement learning methods, especially for Q-learning. In this paper, a fidelity-based probabilistic Q-learning (FPQL) approach is presented to naturally solve this problem and applied for learning control of quantum systems. In this approach, fidelity is adopted to help direct the learning process and the probability of each action to be selected at a certain state is updated iteratively along with the learning process, which leads to a natural exploration strategy instead of a pointed one with configured parameters. A probabilistic Q-learning (PQL) algorithm is first presented to demonstrate the basic idea of probabilistic action selection. Then the FPQL algorithm is presented for learning control of quantum systems. Two examples (a spin-1/2 system and a Λ-type atomic system) are demonstrated to test the performance of the FPQL algorithm. The results show that FPQL algorithms attain a better balance between exploration and exploitation, and can also avoid local optimal policies and accelerate the learning process. © 2012 IEEE.
机译:探索与开发之间的平衡是强化学习方法(尤其是Q学习)的关键问题。本文提出了一种基于保真度的概率Q学习(FPQL)方法来自然地解决该问题,并将其应用于量子系统的学习控制。在这种方法中,采用保真度来帮助指导学习过程,并在学习过程中迭代更新每个动作在特定状态下被选择的概率,这导致了自然的探索策略,而不是带有配置参数的针对性策略。首先提出了一种概率Q学习(PQL)算法,以证明概率动作选择的基本思想。然后提出了用于量子系统学习控制的FPQL算法。演示了两个示例(自旋1/2系统和Λ型原子系统)来测试FPQL算法的性能。结果表明,FPQL算法在探索与开发之间达到了较好的平衡,并且可以避免局部最优策略,加快了学习过程。 ©2012 IEEE。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号